TR - 2014 - 17 An Overview of NVIDIA Tegra K 1 Architecture

نویسندگان

  • Ang Li
  • Radu Serban
  • Dan Negrut
چکیده

This paperwork gives an overview of NVIDIA’s Jetson TK1 Development Kit and its Tegra K1 architecture (32-bit version). We also compare some critical metrics between the Kepler GPU in Tegra K1 and that used in high-end systems, and highlighed that Tegra K1 is more power efficient. Furthermore, we conducted an experiment which shows how Tegra K1 performed compared with Tesla K40C in a specific application. We found that Tegra K1 can achieve better performance per Watt in around half of the benchmarks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mobile Computational Photography with FCam

In this chapter we cover the FCam (short for Frankencamera) architecture and API for computational cameras. We begin with the motivation, which is flexible programming of cameras, especially of camera phones and tablets. We cover the API and several example programs that run on the NVIDIA Tegra 3 prototype tablet and the Nokia N900 and N9 Linux-based phones. We discuss the implementation and po...

متن کامل

Performance Evaluation Of OFDM Based Wireless

................................................................................................................. II CONTENTS ................................................................................................................. III ABBREVIATIONS ..................................................................................................... VII LIST OF FIGURES ....................

متن کامل

Optimized Deep Neural Networks for Real-Time Object Classification on Embedded GPUs "2279

Convolution is the most computationally intensive task of the Convolutional Neural Network (CNN). It requires a lot of memory storage and computational power. There are different approaches to compute the solution of convolution and reduce its computational complexity. In this paper, a matrix multiplication-based convolution (ConvMM) approach is fully parallelized using concurrent resources of ...

متن کامل

Energy-Aware Real-Time Face Recognition System on Mobile CPU-GPU Platform

The Graphics Processor Unit (GPU) has expanded its role from an accelerator for rendering graphics into an efficient parallel processor for general purpose computing. The GPU, an indispensable component in desktop and server-class computers as well as game consoles, has also become an integrated component in handheld devices, such as smartphones. Since the handheld devices are mostly powered by...

متن کامل

Performance and Power Consumption Characterization of 3D Mobile Games

This paper describes a preliminary study of characterizing performance and power consumption characterization of 3D mobile games. We choose Quake3 and XRace as the game benchmarks and study them on TI OMAP3430, Qualcomm Snapdragon S2, and NVIDIA Tegra 2 (three mainstream mobile System-on-Chip architectures) by selectively disabling different graphics pipeline stages in source code level. Our ch...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014